Model Selection

Lightweight Language Model

# Lightweight Language Model

Minillm 0.2B WithWudao

MiniLLM is a lightweight Chinese language model developed based on the bert4torch framework, covering the entire process from pre-training to instruction fine-tuning, with basic dialogue capabilities

Large Language Model

Latent Recurrent Depth Lm

An experimental text generation architecture that captures deeper contextual information through iterative latent processing

Large Language Model

Transformers English

Miniplm Qwen 200M

A 200M-parameter model based on the Qwen architecture, pretrained from scratch using the MiniPLM knowledge distillation framework

Large Language Model

Transformers English

Meta Llama 3.1 8B Instruct Abliterated GGUF

A text generation model employing mixed quantization techniques, with output and embedding tensors in f16 format and other tensors quantized using q5_k or q6_k. It has a smaller size than the standard q8_0 quantization format while maintaining performance comparable to the pure f16 version.

Large Language Model English

Mamba 3B Slimpj

A 3B-parameter language model based on the Mamba architecture, supporting English text generation tasks.

Large Language Model

Transformers English

Llama2 Xs 460M Experimental

This series of repositories open-sources reproductions of Meta AI's LLaMA and LLaMA 2 large language models, but with significantly reduced model sizes. The llama1_s experimental version contains 1.8 billion parameters, while the llama2_xs experimental version has only 460 million parameters.

Large Language Model

Transformers English

A 124M-parameter language model based on the GPT-2 architecture, fine-tuned on 2.23B tokens of diverse data with improved text generation capabilities

Large Language Model

Transformers English

Japanese Gpt Neox Small

A small Japanese language model based on GPT-NeoX architecture, supporting text generation tasks

Large Language Model

Transformers Supports Multiple Languages

Albert Base Japanese V1 With Japanese Tokenizer

This is a Japanese-pretrained ALBERT model that uses BertJapaneseTokenizer as its tokenizer, making Japanese text processing more convenient.

Large Language Model

Transformers Japanese

Mminilmv2 L6 H384 Distilled From XLMR Large

MiniLMv2 is a lightweight language representation model developed by Microsoft, achieving efficient performance through knowledge distillation technology.

Large Language Model

Bert L12 H384 A6

A lightweight BERT model pre-trained on the BookCorpus dataset using knowledge distillation technology, with the hidden layer dimension reduced to 384 and 6 attention heads.

Large Language Model

Distilbert Base Uncased Sparse 90 Unstructured Pruneofa

This is a sparse pre-trained model achieving 90% weight sparsity through one-shot pruning, suitable for fine-tuning on various language tasks.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase